Modeling the Webgraph Evolution with Graph Grammars
نویسندگان
چکیده
The webgraph is the graph generated from the link structure of the web pages. In this graph, each node represents a web page and each edge is a hyperlink from one page to another. The webgraph has a particular structure, that is not similar to other known graphs, and it grows continuously. Nowadays it contains more then 25 billion nodes and about 400 billion edges. Even some properties of this graph have changed over time, there are some basic characteristics that remain the same. One of them is the power law distribution on the indegree of nodes. By this distribution, the probability that a node u be connected by k other nodes is about k , that is, Pru[IN(u) = k] ∼ 1 kβ . The usual value of β in webgraphs is β ≈ 2.1. Surprisingly, the power law distribution is found when analysing other properties of this graph. In 2000, Broder et al. [2] showed that the indegree and outdegree of the webgraph follows the power law distribution. In 2002, Pandurangan et al. [4] observed that the pagerank also follows the power law distribution. In 2004, Donato et al. [3] reported the power law on the distribution of the strongest connected component and their components. Based on some of these empirical observations in realworld webgraphs, models for generating synthetic webgraphs were proposed in the last few years. In all of them, the power law on the indegree distribution is the first property to be verified. A survey on models for generating webgraphs can be found in [1]. Graph grammars provide a formal way to generate graphs, based on the definition of the rules that govern the evolution of the graphs. In this paper, we investigate the suitability of graph grammars to generate and analyze the webgraph. The idea is to use properties that are observed in webgraphs and create rules that preserve these properties. That is, we create a grammar that
منابع مشابه
Modeling with Graph Transformations
In this paper we give a general overview of graph grammars and graph transformation as important modeling techniques for several areas in computer science. In particular we present the main ideas of the algebraic approach, which is the basis for a categorical theory of rule-based transformations of high-level structures. 1 General Overview of Graph Transformation The research area of graph gram...
متن کاملModeling Web Evolution
The Web is the largest human information construct in history transforming our society. How can we understand, measure and model the Web evolution in order to design effective policies and optimize its social benefit? Early measurements of the Internet traffic and the Web graph indicated the scale-free structure of the Web and other Complex Networks. Going a step further Kouroupas, Koutsoupias,...
متن کاملGraph Structure and Evolution of the Korea web
The study of the web graph yields valuable insight into web algorithms for crawling, searching and community discovery, and the sociological phenomena which characterize its evolution, also it is useful for understanding the evolution process of web graph and predicting the scale of the Web. In this paper, we report experimental results on properties of the Korea web graph with over 116 million...
متن کاملAttributed Context-Sensitive Graph Grammars
The paper introduces a concept of attributed context-sensitive graph grammars. The graph grammars are a graphical generalization of the textual grammars and can thus be used to specify the syntax of graphical programming or modeling languages. The attributed graph grammars extend the basic graph grammars with definitions of attributes and the associated attribute evaluation rules. By analogy to...
متن کاملModeling Railway Control Systems Using Graph Grammars: A Case Study
In this report, we develop a railway control system. We are mainly concerned with the software architecture of the control system and its dynamic evolution; we do not discuss here the implementation details of the components forming the control system. The software architecture is characterized by a hierarchy of controllers whose leaves are local controllers connected in a network that mimics t...
متن کامل